Problem Note 62074: Jobs are not load-balanced on available nodes, and you receive a ". . . load balancing object spawner timed out" error
Load balancing does not occur, and jobs are not sent to all the available nodes. You see the following error in the object spawner log:
ERROR: The application could not log on to the server “server-address.” The load balancing object spawner timed out.
Workaround
For integration technologies, check whether the object spawners can communicate with each other successfully. In this type of distribution, typically a start-up cost is specified. That cost must be met before other servers in the cluster are used. Set the start-up cost to a value of 0 if you would like to see a round-robin distribution on the nodes. The Startup cost field is located on the Advanced Options > Performance tab for your workspace server property settings. To activate this change, you must take one of these actions:
- Restart the object spawners.
- If you do not restart them, open SAS® Management Console and select Refresh Spawner in the pop-up menu for each object spawner machine definition in the Server Manager.
In a SAS® Grid environment, check the UJOB_LIMIT parameter in your lsb.resources file. The Load Sharing Facility (LSF) lsb.queues UJOB_LIMIT parameter sets the per-user job-slot limit for the queue. Make sure that each user has the maximum number of job slots for a queue.
This issue is fixed in SAS® 9.4M5.
Operating System and Release Information
SAS System | Platform LSF | Solaris for x64 | 10.1 | 10.1 | 9.4 TS1M5 | 9.4 TS1M5 |
Linux for x64 | 10.1 | 10.1 | 9.4 TS1M5 | 9.4 TS1M5 |
HP-UX IPF | 10.1 | 10.1 | 9.4 TS1M5 | 9.4 TS1M5 |
64-bit Enabled Solaris | 10.1 | 10.1 | 9.4 TS1M5 | 9.4 TS1M5 |
64-bit Enabled AIX | 10.1 | 10.1 | 9.4 TS1M5 | 9.4 TS1M5 |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
Most jobs are directed to only one particular node, and the rest of the jobs remain idle. You see the following error: "ERROR: The application could not log on to the server '
server-address.' The load balancing object spawner timed out."
Type: | Problem Note |
Priority: | medium |
Date Modified: | 2018-09-27 16:29:04 |
Date Created: | 2018-04-03 11:19:26 |